One and two side generalisations of the log-Normal distribution 
by means of a new product definition 
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In this manuscript we introduce a generalisation of the log-Normal distribution 
that is inspired by a modification of the Kaypten multiplicative process using the 
(/-product of Borges [Physica A 340, 95 (2004)]. Depending on the value of q the 
distribution increases the tail for small (when q < 1) or large (when q > 1) values 
of the variable upon analysis. The usual log-Normal distribution is retrieved when 
(? = 1. The main statistical features of this distribution are presented as well as a re- 
lated random number generators and tables of quantiles of the Kolmogorov-Smirnov. 
Lastly, we illustrate the application of this distribution studying the adjustment of 
a set of variables of biological and financial origin. 
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I. INTRODUCTION 

The two-parameter log-Normal distribution, with probability density function, 



p (x) 



1 



Itxox 



exp 



(In a; — fi)" 



x > 0, 



(1) 



has played a major role in the statistical characterisation of many data sets for several 
decades (empirical fitting) and has been an inspiration for theoretical studies as well. The 
form of E q. (fl~l) has been derived in several ways with particularl y emphasis to the works of 



Kapteyn 



Kapteyn. 19031 ] . the Gibrat's la w of proportionate effect jGibrat. 19301 ] . the Theory 



of Breakage introduce d by Kolm ogorov Kolmogorov. 19411 ] or more recently in the theory 
of chemical reactions Fa. 20031 ] . Concomitantly, Eq. (OQ) has been systematically modified 
to cope with different sets of data. Of those generalisations the most famous of them is the 



2 



truncated log-normal distribution, 

p (x) = 



2no~ (x — 7) 



exp 



(In (x -7) - fJ-Y 
2a 2 



< 7 < x, 



(2) 



which has become a mathematical object of study in itself. 

In this manuscript we introduce an alternative generalisation of the log-normal distri- 
bution which we will term the q-log Normal distribution for historial reasons. This pur- 
ported probability density function emerges from changing the traditional algebra in the 
Kaypte n dynamics b y a modified multipl ication operation in dependently introduced by 
Borges Borges. 2004| and Nivanen et al. Nivanen et ai. 20031 ] . This algebra has got di- 
rect consequences on the emergence of asymptotic scale-free behaviour. Specifically, in this 
manuscript we survey a new family of probability density functions based on, 

\2" 



1 



Pq (X) 



Z q xi 



exp 



Og x - 
2 a 2 



[x > 0) , 



(3) 



where ln q (x) represents a generalisation of the logarithm of base e with the normalisation, 



^/ferfc 
yferfc 



y/2cr \ 1-q 



a if q < 1 
a if q > 1. 



(4) 



where 



erfc(x) = 2$(v / 2 



x 



and which in the limit q — >• l^ 1 exactly gives the traditional log-Normal distribution. The 
aim of the present work is to introduce the functional form of the distribution, its dynamical 
origins and statistical features as well as applying it to data of biological and financial origin. 
The manuscript is organised as follows. In Sec. [IT] we give a historical and mathematical 
introduction of the underlying algebra; In Sec. II III we reinterpret the Kaypten scenario for 
the emergence of the log- normal, but using the g-algebra formalism which lead to the q < 1, 
q > 1, and double q- log-Normal probability density function. In Sees. IIVI and IVl we analyse 
their statistical properties and generate random variables according to the distribution. 
Finally, in Sec. IVII we introduce some real examples to which the new distribution is shown 
to be a worthy candidate for medelling the data. 
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II. PRELIMINARIES: THE (/-PRODUCT 



The g-product, <g> 9 , has its origins in the endeavour to extend the subject of statisti- 
cal mechanics to systems exhibiting anomalous behaviour when compared to systems de- 
scribed at Boltzmann-Gibbs equilibrium, i.e., to deal with systems presenting long-lasting 
correlations, ageing phenomena, non-exponential sensitivity to initial conditions, and scale- 
inv ariance occupancy of the allowe d phase space (for detailed explanation of these concepts 



sec 



Tsallis. 20091 . lAbe et al. 2007] )■ The proposed extension of statistical mechanics theory 



is grounded on the entropic functional 

1 — f [p (x)] q dx 



Sn 



q-l 



(q G 



(5) 



(in its continuou s and one-dimensional version) usually called Tsallis entropy as 
well Tsallis. 19881 ] . This entropic form recovers the celebrated Boltzmann-Gibbs-Shannon 
information measure, 

S = — J p (x) In (x) dx, (6) 

in the limit that the entropic parameter q approaches 1. The interpretation of Eq. (j3J) as a 
q generalisation of Eq. ([6]) induced the introduction of analogue functions of the exponential 
and the logarithm, namely, the q-exponential 



exp (x) = [1 + (1 - q) x] 



1-9 



[x,q E 



(?) 



(exp (x) — if 1 + (1 — q) x < 0) and its inverse the q-logarithm Tsallis. 1994| . 



ln ? (x) 



,1-q _ Y 

1-9 



(x > 0,q e 



(8) 



A functional form that generalises the mathematical identity, 

exp [In x + In y] = x x y, (x, y > 0) 

for the g-product is, 

x® q y = exp g [ln 9 x + \n q y] . 
For q — > 1, Eq. fflOl) recovers the usual property, 



(9) 



(10) 



In (x X y) = In x + In y 
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(x, y > 0), with x x y = x ®\ y. Its inverse operation, the g-division, x ® q y, satisfies the 
following equality (x ® q y) <2) q y — x. 

Bearing in mind that the g-exponential is a non-negative function, the g-product must 
be restricted to the values of x and y that respect the condition, 

\x\ x - q + \y\ 1 ~ q -1>0. (11) 

Moreover, we can extend the domain of the g-product to negative values of x and y writing 
it as, 

x® q y = sign (xy) exp q [In, \x\ + ln 9 \y\] . (12) 
Regarding some key properties of the g-product we mention: 

1. x ®i y = x y; 

2. x ® q y = y ® q x] 

3. (x ® q y)® q z = x ® q (y ® q z) = [x l ~ q + y 1 - q -2]^; 

4. (x ® q 1) = x; 

5. ln 9 [x ® q y] = ln 9 x + ln g y; 

6. \n q (xy) = \n q (x) + ln q (y) + (1 - g) ln g (x) ln g (y); 

7. (x^gyy 1 = x~ l (g) 2 - 9 



(X ®q 0) 



if (g > 1 and x > 0) or if (g < 1 and < x < 1] 



i 



1 9 — l) 1 " 9 otherwise 



For particular values of g, e.g., q = 1/2, the g-product provides nonnegative values at 
points for which the inequality + \y\ l q — 1 < is verified. According to the cut-off 

of the g-exponential, a value of zero for x ® q y is set down in these cases. Restraining our 
analysis of Eq. pip to the sub-space x,y > 0, we can observe that for q — > —oo the region 
{0 < x < 1, < y < 1} is not defined. As the value of q increases, the forbidden region 
decreases its area, and when q = 0, we have the limiting line given by x + y = 1, for which 
x ®o y = 0. Only for q — 1, the entire set of x and |/ real values of has a defined value for 



5 



l-q 



+ 



1 1-3 



1 for which 



the g-product. For q > 1, the condition (fTTj) implies a region, \x 
the g-product diverges. This undefined region augments its area as q goes to infinity. When 
q — oo, the g-product is only defi ned in {x > 0, < y < 1} u£0 < x < 1, y > 1}. Illustrative 
plots are presented in Fig. (1) of [Duarte Queiros and Tsallis.2007 ]. 

From the properties presented above we ascertain that the g-product has got a neutral el- 
ement and opposite and inverse elements under restrictions. However, distributive property 
is not held and this fact thwarts the g-product of having commutative ring or field struc- 
tures. Nevertheless, it does not di minish the importance of this algebraic structure as other 



algebras like the tropical algebra Clay. 1992| do not present all the standard algebra prop- 



erties and because the q-pro duct represents a quite rare case of a both-side non-distributive 
structure [Green. 1948 ]. 



Besides its inherent exquisiteness, this generali sation has found its own fi eld of applica- 
bility in the definition of the q- Fourier transform Umarov and Tsallis. 2008 1 which plays a 
key part in non- linear generalisations of the g-Central Limit Theorem Umarov et ai. 20061 ] , 
the definition of a modified charac teristics methods which allows the f ull analytical solution 



of the porous medium e quations 



Pascal-Leibniz triangles 



Umarov and Duarte Queiros. 20081 ] and the structure of 



Petit Lobao et ai. 200 



91- 



III. MULTIPLICATIVE PROCESSES AS GENERATORS OF DISTRIBUTIONS 



Multiplicative processes, particularly stochastic multiplicative processes, have been the 
source of plentiful models applied in several fields of science an d knowled ge. In this con- 



text, we can name the s tudy of flui d turbulence 



Frisch 199711 . fractals [Feder. 19881 ]. fi- 



nance [Mandelbrot. 19971 ] . linguistics [Stauffer et ai. 20061 ] . etc. Specifically, multiplicative 
processes play a very important role in the emergence of the log-Normal distribution as a 
natural and ubiquitous distribution. With regard to the dynamical origins of the log-Normal 
distribution, we have mentioned in Sec. [J the most celebrated examples. Now, we shall give 
a brief account of the Kapteyn's process. To that, let us consider a variable Z obtained from 
a multiplicative random process, 

N 

Z = l[Ci, (13) 



8=1 



6 



where Q are nonnegative microscopic variables associated with a distribution /'(C)- If we 
consider the following change of variables Z = In Z, then we have, 

N 



i=i 



with ( = ln(. Assume now that ( has a distribution / (() with mean /x and variance a 2 . 
Then, Z converges to the Gaussi an distribution in the lim it of N going to infinity as entailed 
by the Central Limit Theorem Araujo and Guine. 1980( |. Explicitly, considering that the 
variables ( are independently and identically distributed, the Fourier Transform of p {Z') is 
given by, 



e tk -f(C) d( 



N 



T\p{Z')) (k) = 

where Z' = N~ l Z. For all N, the integrand can be expanded as, 



(14) 



T\p{Z')\ (k) 



,n=0 



m n (c"> 

! N 



(15) 



F\p{Z')\ (k) = exp {Nln 



l + ik 



(0 

N 



fc 2 ^ + 0(iV- 3 ) 



1 7.2 
2 



where (( n ) represents the nth order raw moment of (. Expanding the logarithm, 

1 



T [P (Z')] (k) w exp 



iku k 2 a 2 

P 2N 



(16) 



Applying the inverse Fourier Transform, and reverting the Z' change of variables we finally 
obtain, 



P(Z) 



1 



V2nNa 



exp 



[Z-Nnf 
2a 2 N 



(17) 



We can define the attracting distribution in terms of the original multiplicativ e random 



process which yields the usual log-Normal distribution 



Crow and Shimizu. 1988] . 



viz) 



V2nNaZ 



— exp 



(inZ-Nfj) 
2 a 2 N 



21 



(18) 



Although this distribution with two parameters, /i and a, is able to appropriately de- 
scribe a large variety o f data sets, there are cas es for which the log-Normal distribution 
fails statistical testing Crow and Shimizu. 19 88]. In some of these cases, such a failure 



7 



has bee n overcome by introducing different statistical distributi ons (e.g., Weibull distri 



butions 



Frechet. 19271 . 



Rosin and Rammler. 1933 



Weibull. 19511 ]) or by ch anging the 2 



parameter log - Normal distribution into a 3-parameter log-Normal distribution 
Finney. 19411 ] . 

(In [x-9\- fi) 2 ' 



Yuang. 19331 . 



p [x) 



2 7TO" [X 



exp 



2<X 2 



(19) 



which is very well characterised in the current scientific literature Cohen. 1988 ]. 



A. One side generalisations 



Moving ahead, we now present our alternative procedure to generalise the distribution in 
Eq. (TjQ). The motivation for this proposal comes from changing the N products in Eq. (fl3l) 
by iV g-products, 



N (?) 

z = H Ci = Cl ®« G ®g 



\ Cn 



(20) 



1=1 



Applying the g-logarithm we have a sum of N terms. If every term is identically and 
independently distributed, then for variables Q = ln q Q with finite variables we have a 
Gaussian has stable distribution [1] , i.e., a Gaussian distribution in the g-logarithm variable. 
From this scenario we can obtain our q-log Normal probability density function, 



1 



Pq W 



Z q xi 



cxp 



(lng x-n)' 
2 a 2 



with the normalisation, 



Z„ 



^ferfc 



i / i 



|erfc 



yfia \^-q 



1 / 1 



{x > 0) , 

a if q<l 
a if q > 1. 



(21) 



(22) 



y2<r \l-q + V ■ 

In the limit of q equal to 1, ln^ia; = In a; and Z q ^\ = \J1 tco~ and the usual log-Normal is 
recovered. The cumulative distribution, 



V{x) 

is given by the following expressions, 

V q> i (x) -- 



p (z) dz, 



1 + erf 




1 + erf 


q-l 1" 


V2a 



8 



and, 



Va<l (X) 



erf 


]n q (x)— \X 
Via 




- erf 


Via \l-q 




1 + erfc 


[ vi(iV^)] 





Typical plots for cases with g = |,g = l,g=| are depicted in Fig. [TJ It can be seen 
that for q greater than one the likelihood of events round the peak as well as large values 
is greater than that for the log-Normal case whereas the case q < 1 favours events of small 
value and the intermediate regime between the peak and the tail. 




0.001 0.01 0.1 



10 100 1000 



FIG. 1: Plots of the Eq. vs x for q = f (dotted line), q = 1 (full line) and q = § (dashed line) 
in linear-linear scale (upper), log-linear (right), log-log (lower). 



The raw statistical moments, 



(x n ) = I x n p(x) dx, 
'o 



(23) 



can be analytically computed for q < 1 giving 

(x n ) 



Gradshteyn and Ryzhik. 1 980] 



T [u] exp 


7 2 " 
8/3_ 




7 




y/P u na(l - q) erfc 







(24) 



9 



with 



P 



2a 2 (1 - q) 



2 ' 



7 



1+M(l~g) , 
(1 T g)' ^ ! 



1 



1-?' 



(25) 



where D_ a [z] is the parabolic cylinder function Wolfram. 20011 ] . Equation ( J24l) allows us to 
write the Fourier Transform or the generating function as, 

<p{k) = 



f p (x) e lkx dx 



E 



T[i>n\ exp 


7 2 " 

8/3 




7r <r(l— g)erfc 


-^(xV^)J 



(26) 



{ik) n . 



For g > 1, the raw moments are given by an expression quite similar to the Eq. (1211) with 
the argument of the erfc replaced by 



1 



1 



V2a \l~q 

However, the finiteness of the raw moments is not guaranteed for every q > 1 for two reasons. 
First, according to the definition of D_ v [z], v must be greater than 0. Second, the core of 
the probability density function (I2~T|) . 



cxp 



(in 9 x — i^y 

2o~ 2 



does not vanish in the limit of x going to infinity, 



lim exp 



(lng x - n) A 
2a 2 









" 7 2 " 


= exp 


2 _ 



(27) 



This means that the limit p (x — > oo) = is introduced by the normalisation factor x q , 
which comes from redefining the Normal distribution of variables, 



V = In? x, 



(28) 



as the probability density function of variables x. Because of that, if the moment exceeds 
the value of q, then the integral (123]) diverges. This has got severe repercussion in the 
adjustment procedures that can be applied. 



B. Two side generalisation 



As visible from Fig. (pQ), our generalisation modifies the tail behaviour for small and large 
values of the variable depending on the value of q which describes the dependence between 
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the variable that is transferred into the g-product. It is well-known that many processes 
are actually defined by a mixture of different laws of formation, some simpler than others. 
Within this context, dual relations namely, 

q = 2-q, 



wherefrom property 7 of the g-product (see Sec. [TTJ) emerges, are very inviting in the way 
that they represent the mapping of a certain rule onto another which seems to be different 
at first but for which there is actually a univocal transformation. Accordingly, we can 
imagine a scenario in which variables follow two distinct paths either q- multiplying or (2— q)- 
multiplying (corresponding to the inverse of the g-product) according to some proportions 
/ and f = 1 — f. This proposal is in fact quite plausible if we bear in mind few of the rife 
examples of mixing in dynamical processes. From that, we establish the law, 

Pq,2-q (X) = fPg (x) + fp 2 - q (x) , (29) 

for which we hold that / = /' = | is the most paradigmatic case. 



C. Alternative interpretation 



The g-log-Normal distribution can introduce another clear advantage. Namely, it pro- 
vides us_jvitJi_jm_j^ dynamical interpretation of the truncated Normal distribu- 



tion 



Johnson and Lotz. 197o| . In other words, we can look at the left (right) truncated 



Normal distribution, 



Ob (y) = \ erfc 

7r a 



in which the truncation factor, 



sgn (6) —f=- fji - b) 



y/2, 



a 



exp 



(y - 

2 a 2 



1 



(30) 



(31) 



and y = hi q x are intimately related to the value of q which controls the g-product part of 
the dynamical process. In this case the Fourier Transform can be analytically determined. 
For left truncations we obtain, 



^ m (y) 



1 + erf 



erfc [B] 



exp 



-~k (2in + ka 2 ) 



(32) 



11 



where erf(x) = — 1. For right-truncations, 



with 



? m (y) 



erfc 



erfc [-B] 



B 



exp 



b — fi 
V2a ' 



-k (2ifi + ka 2 ) 



(33) 



(34) 



IV. EXAMPLES OF CASCADE GENERATORS 

In this section, we discuss the upshot of two simple cases in which the dynamical process 
described in the previous section is applied. We are going to verify that the value of q 
influences the nature of the attractor in probability space. 

A. Compact distribution [0, b] 

First, let us consider a compact distribution for identically and independently distributed 
variables x within the interval and b. Following what we have described in the preced- 
ing section, we can transform our generalised multiplicative process into a simple additive 
process of yi variables which are now distributed in conformity with the distribution, 



v (y) = r [i + (i - q) y] 1-9 



(35) 



with y defined between and b ^ 1 if q < 1, whereas y ranges over the interval between 



-oo and 



ft 1 -?-! 
1-9 



when q > 1. Some curves for the special case 6 = 2 are plotted in Fig. [2j 



If we look at the variance of this independent variable, 



a,, 



(36) 



which is the moment whose finiteness plays the leading role in the Central Limit Theory, we 
verify that for q > |, we obtain a divergent value, 



b 2-2 Q 



(7 „ 



(3-2g) (2-g) 



2 ' 



(37) 



Hence, if q < |, we can apply the Lyapunov's central Limit theorem and our attractor in 
the probability space is the Gaussian distribution. On the oth er hand, if q > |, the Levy- 



Gnedenko's version of the central limit theorem 



Levy. 1954j asserts that the attracting 
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FIG. 2: Plots of the Eq. ([35]) vs y for b = 2 and the values of q presented in the text, 
distribution is a Levy distribution with a tail exponent, 



Furthermore, it is simple to verify that the interval (|, oo) of q values maps onto the interval 
(0, 2) of a values, which is precisely the interval of validity of the Levy class of distributions 
that is defined by its Fourier transform, 

F[L a ] (k) = exp[-a \k\ a ] . (39) 

In Fig. [3] we depict some sets generated by this process for different values of q. 

B. g-log Normal distribution 

In this example, we consider the case of generalised multiplicative processes in which the 
variables follow a q-log Normal distribution. In agreement with what we have referred to in 
Sec. IHIt the outcome strongly depends on the value of q. Consequently, in the associated 
x space, if we apply the generalised process to N variables y = ln g x (x 6 [0, oo)) which 
follows a Gaussian-like functional form [2J with average /x and finite standard deviation a, 
i.e., Vq<i or q > 3 in Eq.( l2T|) . the resulting distribution in the limit of iV going to infinity 
corresponds to the probability density function (1211) with \i — > N n and a 2 — > Na 2 . In 
respect of the conditions of q we have just mentioned here above, the q-log normal can be 
seen as an asymptotic attractor, a stable attractor for q = 1, and an unstable distribution for 
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FIG. 3: Sets of random variables generated from the process (I20D with N = 100 and q = — 5 
(green), (red), 5 (blue), 1 (black), | (magenta) in linear (upper panel) and log scales (lower 
panel). The generating variable is uniformly distributed within the interval [0, 1] as is the same for 
all of the cases that we present. As visible, the value of q deeply affects the values of Xn = Z. 

the remaining cases with the resulting attracting distribution being computed by applying 
the convolution operation. 



V. RANDOM NUMBER GENERATION AND TESTING 



The generation of random numbers is by itself a subject of study and depending on 
distribution different kinds of strategies can be used which start from the shrewd von 
Neumann-Buffon a ccept ance-rej e ction method von Neumann. 195l| and go to more sophis- 



ticated techniques 



Gentle. 2004j |. Since we aim to introduce a global portrait of the dis- 



tribution we have not tried to develop bespoken algorithms but applied the robust method 
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of the Smirnov transformation (or inverse transformation sampling). From a classic robust 
generator of uniform numbers, {z}, between —1 and 1 and considering the probability con- 
servation when z is transformed into y where y is associated with a truncated log Normal 
distribution with parameters /i and a and b given by Eq. ( 13~TT) . For q < 1, i.e., for y = \n q x 
between b and +oo we must use, 



y 



fi + v^crerf _1 



1, 



1) erfc [5] 



whereas for g > 1, i.e., for ?/ = ln„ x between — oo and b 



y 



fi + v^crerf _1 



0,-{z(erf [B] + l) + erf [fi] - 1} 



(40) 



(41) 



From these formulae we have defined the Kolmogorov- Smirnov distance tables that we 
present for typical cases q = 4/5 and q = 5/4 with /x = and a = 1. For each case 
10 6 samples have been considered. 

VI. EXAMPLES OF APPLICABILITY 



In the following examples parameter estimation has been made using trad itional maxi- 
mum log-likelihood methods. In spite of using Brent's method for optimisation Brent. 1973 ] 
of the log-likelihood function, the following set of equations can be solved if a differential 
method is preferred: 



£4lnP(*0=0 

i=i 
n 

£^mP(x,) = . 

i=i 
n 

E^lnP(x,) = 

i=i 

The specific equations can be obtained after straightforward (and tedious) calculus. 



(42) 



A. Shadow prices in metabolic networks 



The re presentation o f metabolic networks is often related to linear programming ap- 
proaches Palsson. 20061 ] for which there is a dual optimisation procedure. In other words, 
the maximisation of the reaction fluxes of a metabolic network with a given stoichiometry 
matrix has as its dual solution the minimisation of a certain function defined by quantities 
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TABLE I: Quantiles of the Kolmogorov-Smirnov statistics of a q-log-Normal distribution with 
q = 4/5, fx = and a = 1. 



n \ 


P 0.80 


0.85 


0.90 


0.95 


0.99 


5 


0.442 


0.471 


0.504 


0.558 


0.663 


10 


0.318 


0.339 


0.362 


0.404 


0.485 


15 


0.261 


0.277 


0.296 


0.333 


0.399 


20 


0.211 


0.225 


0.245 


0.274 


0.334 


25 


0.192 


0.205 


0.222 


0.249 


0.302 


30 


0.176 


0.188 


0.204 


0.228 


0.278 


35 


0.164 


0.175 


0.190 


0.212 


0.257 


40 


0.154 


0.165 


0.178 


0.200 


0.242 


45 


0.146 


0.156 


0.169 


0.189 


0.230 


50 


0.140 


0.149 


0.161 


0.18 


0.219 


60 


0.128 


0.139 


0.148 


0.165 


0.199 


70 


0.119 


0.127 


0.138 


0.154 


0.186 


80 


0.112 


0.121 


0.1301 


0.144 


0.175 


90 


0.106 


0.113 


0.122 


0.135 


0.164 


100 


0.101 


0.108 


0.115 


0.129 


0.156 


n > 100 1.02 n- 1 ' 2 


1.17 n-V2 


1.30 n- 1 / 2 


1.42 n -1 / 2 


1.56 rT 1 ! 2 



traditio nally called shadow price s, LT, which for this case correspond to the chemical po- 



tential [Warren and Jones. 20061 ] . In a previous study the shape of the distribution of the 
shadow prices has been analysed. From the set of tested PDFs the log-normal has proven 
to be the better description. 

Our first example is composed of shadow pri ces of the genome-scale model for E . 
coli (iJR 904) growing on a D-glucose substrate [Reed et a/.. 20031 . iKummel et al. 2006 . 
Reed and Palsson. 20071 ] . The number of shadow prices is 649. Minimisation of the log- 
likelihood function we obtained fi = —0.432, a = 0.838 and q = 1.21 in comparison with 
fi = —0.454 and a = 0.741 for the log-Normal. The corresponding Kolmogorov-Smirnov 
distances are 0.072 and 0.123, respectively, as we depict in FigJH Other qualitatively similar 
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TABLE II: Quantiles of the Kolmogorov-Smirnov statistics of a q-log-Normal distribution with 
q = 5/4, ll = and a = 1. 



n\P 


0.80 


0.85 


0.90 


0.95 


0.99 


5 


0.382 


0.413 


0.454 


0.513 


0.627 


10 


0.286 


0.307 


0.334 


0.377 


0.461 


15 


0.246 


0.262 


0.282 


0.317 


0.384 


20 


0.204 


0.218 


0.237 


0.277 


0.327 


25 


0.189 


0.202 


0.218 


0.246 


0.299 


30 


0.174 


0.186 


0.202 


0.225 


0.276 


35 


0.161 


0.174 


0.188 


0.213 


0.256 


40 


0.155 


0.165 


0.178 


0.204 


0.242 


45 


0.146 


0.156 


0.169 


0.189 


0.229 


50 


0.137 


0.148 


0.162 


0.183 


0.217 


60 


0.128 


0.138 


0.148 


0.165 


0.201 


70 


0.118 


0.127 


0.138 


0.154 


0.186 


80 


0.111 


0.121 


0.1301 


0.143 


0.175 


90 


0.107 


0.113 


0.122 


0.135 


0.164 


100 


0.099 


0.107 


0.115 


0.128 


0.155 


n > 100 1.01 n- 1 ' 2 


1.15 rT 1 ! 2 


1.28 n- 1 / 2 


1.41 n- 1 ' 2 


1.57 n- 1 / 2 



results, q > 1, are found for the shadow prices of models growing upon aerobic conditions. 

A different kind of distribution was obtained when a metabolic network like the M. 
barkeri (iAF 692 model) evolving in a Hydrogen medium was considered. In this case 517 
metabolites are taken into account. The values of the best fit obtained were fi = —0.633, 
a = 1.24 and q = 0.822 in comparison with fi = —0.454 and a = 0.741 for the log-Normal. 

These parameters yield the following Kolmogorov-Smirnov distances 0.049 and 0.080 
which represent a blunt improvement. Moreover, the introduction of the extra parameter q i s 
completely justified when we calculate the Akeike information criterion (AIC) Akeike. 1974| , 

~RSS 



AIC = 2 k + n In 



n 



where k is the number of parameters, n is the number of metabolites of the metabolic network 
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data 

log -Normal 
— q-log-Normal 



2 4 6 8 10 12 14 

n 

FIG. 4: Cumulative density function of the shadow prices vs shadow price of the metabolic network 
of the E. coli (iJR 904) growing on a D-glucose substrate. The symbols are obtained from the data 
and the lines the best fits with the q-log-Normal distribution and log-Normal. The values of the 
parameters and error are mentioned in the text. 

and RSS is the residual sum of squares. The values of AIC per metabolite are —6.288 and 
— 5.607 for the (/-log-Normal and the log-Normal in the case of the E. coli, respectively. For 
the M. barkeri the values are —7.474 and —6.682. This is clearly adduces that the q-\og- 
Normal outperforms the log-Normal distribution which had given the best results. From a 
biological perspective is even more appealing that metabolic networks developed in a aerobic 
environment have presented a value of q > 1 and networks related to anaerobic environments 
yield q values smaller than 1. Whence, we can infer that q value can be possibly used as a 
signature of aerobic and anaerobic growing. 



B. Volatility in financial markets 



One of the keystone elements of mathematical finance is the volatility. Despite appear- 



ing in eve ry theory o 



financial markets the truth is that volatility still lacks a precise 



definition [Engle. 19951 ] . Nonetheless, it is customarily associated with average of squared 
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1.0 








data 

log-Normal 
q-log-Normal 



n 



8 



FIG. 5: Cumulative density function of the shadow prices vs shadow 



price of the metabolic network 



Feist. 20061 ] . The symbols are 



of the M. barkeri (iAF 692 model) growing in Hydrogen medium 
obtained from the data and the lines the best fits with the q-log-Normal distribution and log- 
Normal. The values of the parameters and error are mentioned in the text. 

fluctuations, 

r A (t) =lnS(t + A)-lnS(t), 

of the (log-)price (or index) S over some window T (A is lag). It is well-known for a long 
time that price fluctuations are nicely fitted by the Student's t-distribution. An explanation 
for that relies on the local Gaussianity of the price distribu tions but wit h a time dependent 
variance as it has been hold by heteroskedastic processes [Engle. 19951 ]. In that sense, we 
can consider a variable B (t) —v(t)~, where 

W (t) = ify(t-*). (43) 

i=l 

Accordingly, the distribution of price fluctuations following a Bayesian approach 



p(r) = J P{(3)p{r\B) dB. 



Assuming the Student's t-distribution hypothesis for p (r) and 

~B 



p (r\B) 




exp[— B r 



(44) 



(45) 



19 



the distribution of B must be, 

In this case, we have analysed the distribution of B according to the definition given in 
Eq. ( 1431) using the daily fluctuations of the SP500 index from the 3 rd January 1950 
to the 3 rd April 2009 and considering 5-business days windows with data obtained from 



htt p : / /f inance . yahoo . com (see Fig. [6]). 



18- 
& 16- 



14- 




01/01/1950 01/01/1960 01/01/1970 01/01/1980 01/01/1990 01/01/2000 01/01/2010 

FIG. 6: Evolution of the five-day volatility of the SP500 index as defined in the text after normal- 
isation by its average value. Value above the dashed red line can be considered extreme events. 

Regarding P(B), we have actually noticed that the distribution is poorly described by 
Eq. (|46p . Conversely, we verified a very good agreement with Eq. (1291) as we show in Fig. [7J 
The values obtained for Pp (B) are k = 0.314 and 9 = 1.41 and for Vq,2-q (£>) we have 
jj, = 0.391, cr = 1.15 and q = 1.22. The values of the Kolmogorov-Smirnov distances yield 
0.0959 and 0.0126 which has passed the statistical test for a = 2%. A representation of 
the probability density function adjustment is shown in Fig. [7J Although not shown here a 
log-Normal adjustment which yielded \x = 0.379 and a = 1.121 and a Kolmogorov-Smirnov 
distance of 0.0177 which is 40% greater than the Kolmogorov-Smirnov distance of the q- 
Log-Normal. The utilisation of two values for q (although they relate one another) can 
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be understood if we ac cept tested hypotheses that t he volatility runs over two mechanisms 
(short and long scale) |Bouchaud and Potters. 20001 ] . Nevertheless, it is worth mentioning 
that applying Eqs. (I4"4"1) -( H6"1) with the values we have determined brings about a Student- 
t distribution with v = 2.64 that is i n accordance with th e value measured for the tail 
exponent of that distribution (see e.g. jDuarte Queiros. 20051 ] ) . 



o 0.500 

g 0.100 
0.050 



0.010 
0.005 

0.001 

0.1 1 10 100 

B 

FIG. 7: Probability density function of the 5-day volatility vs B . The symbols are obtained from 
the data and the lines the best fits with the Gamma distribution and the double sided q-log-Normal. 
The values of the parameters and error are mentioned in the text. 




VII. FINAL REMARKS 



In this manuscript we have introduced a new kind of generalisation of the log-Normal 
distribution which stem from a modification of the multiplicative cascade using a new type 
of algebra recently introduced in a physical context. This modification of the g-product, 
as shown in other cases, represents a way of describing a type of dependence between the 
variables. Accordingly, the new distribution differs from the classical log-Normal by a single 
parameter q which favours the right-side tail for q > 1, the left-side tail if q < 1 and recovers 
the traditional form when q — 1. We have made an extensive description of the distribution 
namely by defining the moments, the Fourier Transform we have purported a generator 
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of random numbers as well which yield the distributions we mentioned here. Using these 
random number generators we have depicted the construction of a P-value table for the 
Kolmogorov-Smirnov distance when the g-log-Normal distribution is assumed. Moreover, 
we have tested the distribution against real data of biological and financial origin. Both 
results have shown its usefulness and all the cases we have studied curiously present values 
of q or 2 — q close to 5/4. Concerning future work we can mention the modification of the 
two branched distribution to accommodate equal weights for / and /' as we have considered 
here, using different dual relations for the q parameters or parameters that are not related 
by any dual relation as well. This last approach corresponds to accepting different mixtures 
of dynamical or structural processes. It is obvious that such modifications augment the 
number of the parameters which might be plainly justified by usual statistical criteria. 
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